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.Title: An in vitro n^thod for construction of a DNA library 



5 FIELD OF THE INVENTION 

The present invention relates to optimizing DNA sequences 
in order to (a) improve the properties of a protein of interest 
by artificial generation of genetic diversity of a gene encoding 
the protein of interest by the use of the so-called gene- or DNA 

10 shuffling technique to create a large library of "genes", ex- 
oressing said library of genes in a suitable expression syscer. 
and screening the expressed proteins in respect of specific char- 
acteristics to dster;T.ine such proteins exhibiting desired proper- 
ties or (b) irr.prove the properties of regulatory elements such as 

15 oro-oters or terr.mators by generation of a library of these ele- 
ments, transforming suitable hc-s'zs therewith in operable conjunc- 
tion with a structural gene, expressing said structural gene and 
screeninc for desirable properties in the regulatory element. 



20 



Hr.c?:GRQuNO or THE i::vc:;tic:: 

It is generall',' feunc tnat a protein performing a certain 
broactivity exhibits a certain variation between genera and even 
between members or the ear^e species differences may exist. This 
25 variation is cf course even more outspoken at the genomic level. 

This natural genetic diversity among genes coding for pro- 
teins having basically the same cioactivity has been generate- m 
I'lature over billions of years anc reflects a natural optimization 
of the proteins cocez for in respect of the environment of the 
30 crcanism. in cues t ion. 
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In today's society the conditions of life are vastly re- 
moved from the natural environment and it has been found that the 
naturally occurring bioactive molecules are not optimized for the 
various uses to which they are put by mankind, especially when 
they are used for industrial purposes. 

It has therefore been of interest to industry to identify 
such bioactive proteins that exhibit optimal properties in re- 
spect of the use to which it is intended. 

This has for many years been done by screening of natural 
sources, or by use of mutagenesis. For instance, within the tech- 
nical field of enzyw.es for use in e.g. detergents, the washi.-g 
and/or dishwashing performance . of e.g. naturally occurring pro- 
teases, lipases, amylases and cellulases have been improved sig- 
nificantly, by 'in vitro modifications of the enzymes. 

In most cases these Irr.provements have been obtained cy 
site-directed mutagenesis resulting in substitution, deletion cr 
insertion of specific a-.ino ac-d residues which have been chosen 
eit-er on the basis of their type or on the basis of their loca- 
tion in the secondary or tertiary structure of the mature enz\-e 
20 (see for instance US patent r.o. 4,518,584). 

In this manner the preparation of novel polypeptide vari- 
ants and mutants, sutr: as novel modified enzi/m.es with altered 
characteristics, e.g. specific activity, substrate specificity, 
thermal, pH ana salt stability, pH-optimum, pi, Km' ^max ^tc, 
25 has successfully been performed to obtain polypeptides with im.- 
proved properties. 

For instance, withm the technical field of enzymes the 
/ashing and/or dishwashing performance of e.g. proteases, li- 



pases, amylases and cellulases have been improved significantly. 
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An alternative general approach for modifying proteins and 
enzymes has been based on random mutagenesis, for instance, as 
disclosed in US 4,894,331 and WO 93/01285 

As it is a cumbersome and time consuming process to obtain 
5 polypeptide variants or mutants with improved functional prop- 
erties a few alternative methods for rapid preparation of modi- 
fied polypeptides have been suggested. 

Weber et al., (1983), Nucleic Acids Research, vol. 11, 
5661, describes a method for modifying genes by in vivo recom- 
10 bination between two homologous genes. A linear DNA sequence com- 
prising a plasmid vector flanked by a DNA sequence encoding al- 
pha-1 human interferon in the 5 ' -end and a DNA sequence encoding 
alpha-2 human interferon in the 3 '-end is constructed and trans- 
fected into a rec A positive strain of £. coJi. Recombinants were 
15 identified and isolated using a resistance marker. 

Pompon ei al., (196 9). Gene 83, p. 15-24, describes a 
method for shuffling gene do-ams of mamjnalian cytochrome P-450 
by in vivo recombination o: partially homologous sequences in 
Saccharo-yces cerevisiae by transforming Saccha rorr/ces cerevisiae 
20 v;itn a linearized plasrr.ic vith filled-in ends, and a DNA fragm.ent 
beir.g partially homologous to the ends of said plasmid. 

In WO 97/07205 a raathod is described whereby polypeptide 
variants are preoared b'/ snaffling different nucleotide sequences 
of nomologous DNA sequences by in vivo reco:tbina t ion using plas- 
25 midic DNA as template. 

U5 patent no, 5,053,257 (Assignee: Genencor Int. Inc.) dis- 
closes a method for producing hybrid polypeptides by in vivo re- 
corrbination . Hybrid DNA sequences are produced by forming a cir- 
cular vector comprising a replication sequence, a first DNA se- 
30 cuence encoding the ammo- tn r::.inai portion of the hybrid poly- 
oeotide, a second DNA sequence encoding the ca rboxy- termina 1 por- 
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tion of said hybrid polypeptide. The circular vector is trans- 
formed into a rec positive microorganism in which the circular 
vector is amplified. This results 'in recombination of said circu- 
lar vector mediated by the naturally occurring reccnbinatic?. 
5 mechanism of the rec positive microorganism, which include prc- 
karyotes such as Bacillus and E. colir and eukaryotes such as 
Saccharomyces cerevisiae , 

One method for the shuffling of homologous DNA sequences, 
has been described by Stemmer (Stemmer, (1994), Proc, Natl. Acad. 

10 Sci. USA, Vol. 91, 10747-10751; Steamer, (1994), Nature, vol. 
370, 389- 391) . The method concerns shuffling homologous DNA se- 
quences by using in vitro PC?, techniques. Positive recombinar.: 
genes containing shuffled DNA sequences are selected from a DIL-. 
library based on the improved . function of the expressed proteins. 

15 The above method is also described in WO 95/22625. KO 

95/22625 relates to a method for shuffling of homologous DNA se- 
quences. An important step in cne method described in WO 95/22625 
is to cleave the homologous "^er.plate double-stranded polynuclec- 
tide into random fragments of a desired size followed by hoT.olc- 

20 cously reasserr^lmg of tne fragments into full-length genes. 

A disadvantage inherent to the nethod of WO 95/22625 is, 
hov;aver, that the civersitv generated through that m.ethod is lir:- 
ited due to the use of horr.olcgous gene sequences (as defined in 
WO 9 5/22625) . 

25 Another disadvantage in the method of WO 95/22625 lies m 

the production of the rando::: frag~ents by the cleavage of the 
terrplate double-stranded polynucleotide. 

A further reference of interest is WO 95/17413 describing a 
m.ethod of gene or DNA shufflinc by recombination of specific CNA 

30 sequences - so-called design elements [DZ] - either by recombina- 
tion of synthesized doubie-s t ranned fragments or recombination ci 
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PGR generated sequences to' produce so-called functional elements 
(FE) comprising at least two of the design elements. According 
to the method described in WO 95/17413 the recombination has to 
be oerformed among design elements that have DNA sequences with 
sufficient sequence homology to enable hybridization of the dif- 
ferent sequences to be recombined. 

WO 95/17413 therefore also entails the disadvantage that 
the diversity generated is relatively limited. Furthermore the 
method described is time consuming, expensive, and not suited for 
auton^.atisation . 

Despite the existence of the above methods there is still a 
need for better iterative in vitro recombination methods for pre- 
oaring novel polypeptide variants. Such methods should also be 
capable of being performed 1. snail volumes, and amenable .0 

autcnatisation - 

SUMM^.RY or THE INVENTION 

^"'^^^^''^^^^Z''^^^^ co.. = err.s briefly the utilization of 

template shift of a newly synthesized DNA strand curing m vitro 

^'^^^'^^ n^-:^ spu^^^linq. By using this 
DNr. synthesis in orcer to aznxe.e D...-. sn^.-ixw^ y 

. ^-^^.^^ s'^-h -esuits m a more expedi- 

. G-^ater variation than in 

ent rr;anner, ana to so...^ ^ ^ ^ _^ 

the above rientioned methods . 

The rp.ethod of the inventior 

adeotion to autorriatisat ion . 

[ n a p : 

tion with an error-prone 
greater variation in the library created, 

1^ ^v.^ ^.-oc-".n- ^nvent^on relates to a methoc 
More speciricaliy the pi-s-^n. ^uj^al^^.. 

. ^ of recoiT^lDined hon^ologous 

for - the construction c: ^ 

, . . .rn- - p..>-^o"^- G^' different starting single or 
Dolynucleot laes ^ro.a a n. 



IS also very well suited for 



. -^chn^Gue is used in combina- 

Tn p n-^rerred e"JOoai...e. . ^ -..^ .^cuu^-l.^ 



^v.-erase thereby introducing an even 
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double stranded parental ' DNA templates and priiuers- by indue 
template shifts during an in vitro polynucleotide synthesis usi 
a polymerase, whereby 

A. extended primers or polynucleotides are synthesized by 

a) denaturing parental double stranded DNA templates 
produce single stranded templates, 

b) annealing said primers to the single stranded DNA te: 
plates f 

c) extending said primers by initiating synthesis by use ; 

said polymerase, 

d) cause arrest of the synthesis, and 

e) denaturing the double, strand to separate the extenc. 
primers frorr. the templates, 

B a temolate' shift is induced by 

a) isolating the newly synthesi zed single stranded extend. 
o^^m--s trox the templates and repeating steps A.b) 

o-^mers produced in (A) as bo' 
A.e) using said eA^e...^^- j^^^^-^i. 

primers and templates, or 

b) reoeating steps A.b) to A.e), 

r ,he above process is .er.,inated after an appropriate n-a.Ti- 



of cvcles of process 



:eos A. and B.a) , A. and B.b) , or cc 



binations tnereo:, anc 
0 ootionallv tne proouced oolynucleotides are amplified in 
3,,,,3,, PC. reaction witn specific primers to selective 
amplify homologous polynucleotides of interest. 



■,.^^^^^--c- various modifications can oe m 
In soecific ex^oam.en ^ s vaiiuuiD 

:he orocess of the mvc] 



on. For example it is advantage; 



^ ^-^u^^v- error-orone ooiyme 

to apply a defective polyme:c..^^ ei.n^.r an e.^u . 

, , , ^ ^.,...,nn- - ---ison to tne ue^plates, or a 

to ^introduce mu^aL^xon^ .,....,....^^0 
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lymerase that will discontinue the polynucleotide synthesis pre- 
maturely to effect the arrest of the reaction. 

Further modifications will be described below. 
In a further aspect the invention relates to a method of 
5 identifying polypeptides exhibiting improved properties in com- 
parison to naturally occurring polypeptides of the same bioactiv- 
ity, whereby a library of recombined homologous polynucleotides 
produced by the above process are cloned into an appropriate vec- 
tor, said vector is then transformed into a suitable host system, 
10 to be expressed into the corresponding polypeptides and dis- 
played, said polypeptides are then screened in a suitable assay, 
and positive results selected. . 

In a still further aspect the invention relates to a m.ethod 
for producing a polypeptide of interest as identified in the pre- 
15 ceding process, whereby a vector comprising a polynucleotide en- 
coding said polypeptide is transformed into a suitable host, said 
host is grown to express said polypeptide, and the polypeptide 
recovered and purified. 

Finally, in further final aspects the invention 
20 relates to a recombmed/shuf f led protein, which is obtainable by 
any of the methods according to the invention, and which is a 
recorrjDined/shuffled protein comprising the sequences disclosed 

herein (vide infra). 

In those final aspects of the invention, the term, 
25 -obtainable" denotes that said protein is preferable obtained by 
a method according to the invention. However a prior art known 
recombination/shuffling technique such as those described in v;0 
95/22625 or VJO 95/17413 may be used too, either alone or m 
combination with a method according to the invention, in order to 
30 obtain said recombined protcir-.. 

Accordingly, further final aspect of the invention are; 
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a recor:±>ined/shuf f led protease obtainable by any of the 
methods according to the invention, and comprising a recombined 
sequence, which at least contain two different partial sequences 
from at least two different wild-type proteases; 

a recombined/shuffled lipase obtainable by any of the 
methods according to the invention, and comprising a recombinec 
sequence, which at least contain two different partial wild-type 
sequences 

a recombined/shuffled Pseudomonas lipases obtainable by any 
of the methods according to the invention, and comprising a 
reccrjoined sequence, which at least contain two different partial 
sequences from at least tw.o of the different wild-type 
Pseudomonas lipases ; 

a recorrbined/shuf f led xyianase obtainable by any of the 
methods according to the inver.tion, and comprising a reco:?JDined 
secuence, which at least conta:.;- two different partial sequences 
from at least two different wilc-type xylanases; 

a recon^JDined/shuf f led cellulase obtainable by any of the 
mothods according to the invention, and comprising a reco-oined 
sec-ence, whicn at least ccntam two different partial sequences 
itom at least two difrerer.t wild-type cellulases; 

a reco::uDined/sn'j::le- amylase obtainable by any of the 
metnods according to tne invention, and comprising a recombmed 
soz\Ae-r\cc, which at least contain two different partial sequences 
from at least two different wild-type amylases 

a recorri^med/shu: f led laccase obtainable by any of the 
]:vethods according to the invention, and comprising a recorri^inea 
secucnco, which at least cont?.in two different partial sequences 
froti ni least two different wilt-type laccases; 

a reco;i;bined/5hu: f led rnytase obtainable by any of the 
i-ethods according to the invention, and comprising a recorrloine: 
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sequence, which at least contain two different partial sequences 
from at least two different wild-type phytases. 
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5 DEFINITIONS 

Prior to discussing this invention in further detail, the 
following terms will first be defined. 

"Shuffling": The term "shuffling" means recombination of 
nucleotide sequence fragment (s) between two or more homologous 
polynucleotides resulting in output polynucleotides (i.e. polynu- 
cleotides having been subjected to a shuffling cycle) having a 
number of nucleotide fragments exchanged, in comparison ,to the 
input polynucleotides (i.e. starting point hom.ologous polynucleo- 
tides) . 

"Homology of DN.^.. sequences or polynucleotides" In the pres- 
ent context the degree of DMA sequence hom.ology is determined as 
the degree of identity between two sequences indicating a deriva- 
tion of the first sequence fro:., the second. The homology may 
suitably be determined by means of computer programs known in the 
art, such as Gh? provided in the GCG program package (Program 
J-'anual for the Wisconsin Package, Version 8, .August 1 994, Genet- 
Q^Q,.^ 5 Science Drive, Madison, Wisconsin, US.n 
dleman, S.3. and Wunsch, CD., (1970), Journal of Mo- 
lecular Biology, 43, 4 43-453) . 

"Homologous": The term, "homologous" means tnat one single- 
stranded nucleic acid sequence m.ay hybridize to a complementary 
single-stranded nucleic acic sequence. The degree of hybridiza- 
tion may depend on a nu:rber of factors including the amount of 
identity between the sequences and the hybridization conditions 
30 such as tem.oerature a:.d salt concentration as discussed later 
( Vice infra ) . 



2C 
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Zorr.oute: 
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Using the computer program GAP {vide supra) with the fol- 
lowing settings for DNA sequence comparison: GAP creation penal-y 
of 5.0 and GAP extension penalty of 0.3, it is in the prese." 
context believed that two DNA sequences will be able to hybridize 
5 (using low stringency hybridization conditions as defined belov) 
if they mutually exhibit a degree of identity preferably of a- 
least 70%, more preferably at least 80%, and even more preferably 
at least 85%. 

"heterologous": If two or more DNA sequences mutually ex- 
10 hibit a degree of identity which is less than above specifiec, 
they are in the present context said to be "heterologous". 

^^Hybridization : Suitable experimental conditions fcr 
determining if two" or more D^'A sequences of interest do hybridize 
or not is herein defined as hybridization at low stringency as 
15 described in detail belov/. 

Molecules to which t:-e oligonucleotide probe hybridizes 
under these conditions are detected using a x-ray film or a phos- 
phoimager . 

^^orimer": The ter" ^^pri-.er" used herein especially in ccn- 
20 nection with a PGR reaciion is an oligonucleotide (especially a 
^^?CR-orimer") defmec ar.u constructed according to general stan- 

, ■ ^ T ^ -w^. {'^?CR A oractical aooroach" 

card speci^icaLiv^.i^^ •••• -•• ■ ^ 

IRL Press, (1991) ) . 

-A primer directed to a secuer.ce:" The term ^^a primer di- 

25 rected to a sequence" means tnat the primer (preferably to be 
used m a PGR reaction) is constructed to exhibit at least 60% 
degree of sequence identity to tne sequence fragm.ent of interest, 
more preferably at least iO\ cegree of sequence identity to the 
sequence fragment of interest, v;nich said primer consequently is 

30 -directed to^ , The primer is ce-gned to specifically anneal at 
the seauence fragment or recinn it is directed towards at a g 



T 1 V = 
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temperature. Especially identity at the 3' end of the primer is 
essential . 

"Flanking" The term "flanking" used herein in connection 
with DNA sequences comprised in a PCR-fragment means the outer- 
most partial sequences of the PCR-fragment, both in the 5^ and 3' 
ends of the PGR fragment. 

"Polypeptide" Polymers of amino acids sometimes referred 
to as proteins. The sequence of amino acids determines the 
folded conformation that the polypeptide assumes, and this in 
turn determines biological properties and activity. So.Te 
polypeptides consist of a single polypeptide chain (monomeric) , 
whereas other comprise several associated polypeptides 
(multimeric) . All 'enzym.es and antibodies are polypeptides. 

-^Enzyme" 'A protein capable of catalysing chemical reac- 
tions. Specific types of enzyrr.es to be mentioned are such as 
aT.ylases, proteases, carbohydrases, lipases, celiulases, o.v.i- 
doreductases, esterases, etc. Of specific interest in relation 
to the present inventior. are enzy-es used in detergents, such as 
oroteases, lipases, celiulases, amylases, etc. 



AILED DESCRIPTION OF THS INVENTION 
" The present invention relates in its first aspect to a 
method for the construction of a library of recombined horr.ologous 
polynucleotides fro.^ a nurr^i^er of different starting single or 
double stranded parental DNA templates and primers by induced 
template shifts during an m vitro polynucleotide synthesis using 
a polyrr.erase , whereby 

A. extended primers or polynucleotides are synthesized by 

a) denaturing a parental double stranded DMA template to 
produce single stranded templates, 
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b) annealing said primers to the single stranded DNA te 
plates, 

c) extending said primers by initiating synthesis by use 
said polymerase, 

d) cause arrest of the synthesis, and 

e) denaturing the double strand to separate the extend 
primers from the templates, 

B. a template shift is induced by 

a) isolating the newly synthesized single stranded extends 
primers from the templates and repeating steps A.b) z 
A.e) using said extended primers produced in (A) as bc" 
primers and templates, or 

b) repeating steps A.b) to A.e), 

C. the above process is terminated afner an appropriate numbr 
of cycles of process steps . and 3. a), A. and 3.b), or cor 
binations thereof, and 

D. optionally the produced polynucleotides are amplified in 
standard PGR reaction with specific primers to selective! 
amplify homolocous polynucleotides of interest. 

Accorcinc to the invention the polymerase m.ay be a DNA or 
RN'A oolvtr-rase, spenific poiytt.erases to be mentioned are such a 
DNA ool^^.erases like T4 polvtt.erase , 17 polym.erase , E. coll y. 
polyr^-rase I or tne Klenow fragment of DMA polym.erase I, or the. 
mostable polvTaerases such as Tac, Amplitaq, Vent, Pwo . 

One of the advantages of trie invention is that it makes 
possible to control the length of the extension of the primers 
the reaction in a convenient manner. 

This can be accomplisnec by various means such as choice 
polymerase, the physical anc enertical conditions during the a 
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tion of the polymerase, e.g. pH, temperature, buffer, salt con- 
centration, and addition of various chemicals. 

It is known that various polymerases carry out the Dl^A syn- 
thesis at different rates (nucleotides incorporated pr. second) . 
5 For example has the Klenow fragment of DNA polymerase I a limited 
extension rate compared to e.g. the Taq polymerase (Sambrook et 
al. 1989) . 

Polymerases also display differences in processivity, which 
is the average number of nucleotides incorporated before the po- 
10 lyr.erase dissociates from the template/extended primer; again the 
Klenow polymerase is an example of a polymerase with limited 
processivity. 

The choice of polymerase is therefore an important m.eans in 
controlling the average extension of the primers. 
15 These conditions may also exert an influence on the fidel- 

ity of the polymerase (the rate oy which point mutations are in- 
troduced; KIV reverse transcriptase is an example of a polymerase 
of lov; fidelity), a parar-eter useful in combining shuffling and 
mutagenesis. 

20 In specific eriodiments various modifications can be made 

ir, the process of the ir.vention. For example it is advantageous 
to apply a defective polyr-rase either an error-prone polymerase 
to introduce mutaticns in comparison to the templates, or a po- 
lv::.erase that will discontinue tne polynucleotide synthesis pre- 

25 maturely to effect the arrest of the reaction. Such a defective 
poly-erases that could be mentioned is a Klenow polymerase having 

low processivity. 

In another ert^ociment c: the invention polym.erase will be 

added after each cycle, if ■..;.i polym.erase used is not therm,o- 

30 stable. 
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According to the invention the starting single or double 
stranded parental templates may be different in that they contair. 
different point mutations in the same native polynucleotide 
(gene), or they can be homologous polynucleotides (genes) iso- 
5 lated from nature, which may amplified by PCR, or they can re 
combinations thereof. The templates used in the process of the 
invention are hereby homologous showing an identity at the DNA 
level of e.g. more than 951, 90%, 85%, 80%, 75%, 70%, 65%, 60*, 
55%, or even more than 50% identity. 

IQ It may be advantageous to use pre-selected templates ccr- 

prising mutations with improved properties of interest. The pres- 
ent recombination method of the invention will then reconibir.e 
said improved mutations for subsequent screening for even further 
improvements in the properties o: interest. 

1^5 Said pre-selected te-plates with improved properties of in- 

terest may have been identifie:: by standard procedures in the art 
comprising e.g. i) error-pror.e PCR of templates of interest fol- 
lowed by li) screening/selectior. for templates with improved 
characteristic of interest. The mutagenesis frequency (low or 

20 hicn m.utage.nesis frequency) of tr.e error-prone PCR step is pref- 
erably adDusted in relation to tne subsequent screening capacity, 
i.e. L-: the scresnir.c capacity is limited the error-prone ?CR 
frecuency is preferably low (i.e. one to two mutations in each 
template) (see WO 92/ 155^:5 for further details). 

2 5 Tne arrest of the polym.erase reaction in step h.d) may as 

indicated above be obtained m different ways, such as by raising 
the te.mperature, or acding .scecific reagents as described in viO 
95/17413. 

When raising the tem.pe i.v.urc for this purpose, it is pre- 
30 ferred to use tem.oeratures between 90°C and 99°C. 
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When using chemical agents DMSO is a possibility. Appropri- 
ate procedures are mentioned in e.g. WO 95/17413. 

The process of the invention uses annealing of the primers 
to the templates in step l.b. In this context the annealing may 
5 be random or specific, meaning either anywhere on the polynucleo- 
tide or at a specific position depending on the nature of the 
prrmer . 

Also; the primers to be used may be completely random prim- 
ers (NNNNNNNNNNN) (N meaning a mixture of the four bases (A, T, 
10 G, C) is used at a particular position in the primer during syn- 
thesis) , semi-random primers, or specific primers. 

If the extended primers produced are to be separated from 
the primers during the process it is convenient to use labeled 
templates in order to provide a simple means for separation, a 
15 oreferred marker is biotin or digoxigenin. 

According to the invention the numh^er of cycles necessary 
v;ill be less than 500, in most cases less than 200, and normally 
less than 100 cycles. 

In an embodiment: of the invention the above in vitro shuf- 
20 flmc 15 co-l)ined v;izh a subsequent in vivo shuffling by methods 
s-ch as those described in 97/07205. 

In its second asoest the invention relates to a method o: 
identifying polypeptides exhibiting improved properties in com- 
parison to naturally occurring polypeptides of the same bioaciiv- 
25 itv, v/hereby a library of reco"bined homologous polynucleotides 
produced by the above process are cloned into an appropriate vec- 
tor, said vector is then trans forrr^ed into a suitable host system, 
to be expressed into tne corresponding polypeptides; said 
polvoeotides are then screened ir; a suitable assay, and positive 
30 polypeptides selected. 
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In an embodiment it is contemplated that the polypeptides 
of interest encoded by the shuffled library are expressed as a 
suitable fusion protein (e.g. as a hybrid with glll of bac- 
teriphage M13/fd) in order to display said recombined polypeptide 
5 on the surface of phage or bacteria. 

In a third aspect the invention relates to a method fcr 
producing a polypeptide of interest as identified in the preced- 
ing process, whereby a vector comprising a polynucleotide encod- 
ing said polypeptide is transformed into a suitable host, saiz 
10 host is grown to express said polypeptide, and the polypeptide 
recovered and purified. 

The use of partial random (semi-random) or completely ran- 
dom primers (mixtures of bases in a selected number or. all posi- 
tions in the primer) as initiation point for the DNA synthesis 
15 provide certain novel possibiiicies for the corrbined use of shuf- 
fling and random mutagenesis. 

it is often associated v.'ith difficulties to obtain an Ir. 
vitro recombination of polynucleotides that display relatively 
lir.ited homology. By the use of an errbodiment of the inventicr. 
20 even verv diverse pol vnucleot ides can be forced to reconrbine. 

According to that embodiment at least two templates (or twD 
pools of diverse templates) are applied. Tne novel synthesis o: 
the one pol vnucleo^ ice can then be based on only one strand (i.e. 
eitner trie sense or che anti-sense strand), and the synthesis of 
25 the other polynucleotide is based the opposite strand. 

This can be accorr.ol ishec by isolating the complementary 
strands from the two ter^.plates, e.g. by having these strands la- 
beled bv biotin. Synthesis of D/iA is initiated by annealing ei- 
ther specific, partly ranccm or completely random primers zo 
30 these templates and adding a suitable polw.erase. This can he 
perform.ed as either separate reactions for the different te"- 
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plates or in just one reaction. Synthesis should preferably be 
performed under conditions that favor production of relatively 
short new fragments. These fragments can subsequently be isolated 
from the templates based on the affinity label. A PGR reaction is 
5 carried out on these fragments and as the starting material 
originates from two different strands, the newly synthesized 
fragments must recomiDine in order to produce full length PGR 
products - a kind of forced recombination. 

Also for this embodiment rapid PGR with short or no exten- 
10 sion time can be applied advantageously in order to enhance re- 
combination, especially if pools of templates are used for the 
two strands. 

The length 'of the primer and the annealing temperature 
utilized in the process determines if random primers will anneal 

15 and the ncmber of mismatches between the template and the primer 
that can be accoir.modated . By varying the primer length and the 
annealing temperature the .method of the invention provides a 
means for achieving randorr. n.tagenesis within a certain nucleo- 
tide window representing the length of the primer. The method of 

20 the invention thereby provides substantial benefits compared to 
other random mutagenesis approaches, especially the high prob- 
ability for several base substitutions close to each other in the 

-, -'no r,=;p nf a comoletely random 20'rr,er 
primary sequence, e.c. ^ne use ol ^ uw.u^x j 

(rr.ixture of all four nucleotides in all 20 positions) will ac- 
25 cording to theory under given experimental conditions give a cer- 
tain reasonably high probability for having several base substi- 
tutions close to each other. 

Error prone PGR {=hich m.utagenesis frequency PGR) does not 
provide this possibility, -ror prone PGR provides a very low 
30 probability for having more than one base substitution within one 
codon (coding for one amino acid m a translated polypeptide). 
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Obviously the substitution of only one base within a codor. 
doesn't provide total random mutagenesis (at protein level) as 
only a limited set of amino acid substitutions can be obtained by 
one base substitution at DNA level (e.g. Methionine encoded by 
.5 ATG-codon requires three base substitution to become the TGT or 
TGC-codon encoding Cysteine) . 

In one embodiment of the invention the process is therefore 
performed by using random or semi-random primers having a length 
of from 6 to 200 bp^ preferably from 10 to 70 bp, and betner fro~ 
10 15 to 40 bp. 

One of the advantages in the method of the invention is the 
robustness. In some errLbodiments the constant presence of full 
length template provides a further advantage avoiding' FCR con- 
tamination problems. Furthermore it is much less laborius, with 
15 less hands on, than other described methods, thereby providing 
excellent possibilities for autir.^tion. 

PGR" a rime rs : 

The PGR primers are consrructed according to the standard 
20 descriptions in the art. N'orrr.ally zhey are 10-75 base-pairs (bp) 
lone. However, for the specific er:i3odim.ent using random^ cr semi- 
randor" primers the ler.cic :;:cy be substantially longer as indi- 
cated above. 

2 5 PCR-reactions : 

If not otherwise mcnticr.ec the PCR-reaction performed ac- 
cording to the invention are ceriormed according to standard pro- 
tocols known in the art. 

The term ''Isolation of : fragment" is intended to cover 
30 as broad as simply an aliqc:--. claming the PGR fragment. How- 
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ever preferably the PGR fragment is isolated to an extend which 
remove surplus of primers^ nucleotides templates etc. . 

In an embodiment of the invention the DNA fragment (s) 
is (are) prepared under conditions resulting in a low, medium or 
5 high random mutagenesis frequency. 

To obtain low mutagenesis frequency the DNA sequence (s) 
(comprising the DNA fragment (s)) may be prepared by a standard 
PGR amplification method (US 4,683,202 or Saiki et al., (1988);. 
Science 239, 487 - 491) . 

10 A medium or high mutagenesis frequency may be obtained by 

performing the PGR amplification under conditions which increase 
the misincorporation of nucleotides, for instance as described by 
Deshler, (1992), GATA 9(4), 103-106; Leung et al., (1989), Tech- 
nique, Vol. 1, No. 1, 11-15. 

15 It is also contemplated according to the invention to com- 

bine the PGR amplification (i.e. according to this embodiment 
also DNA fragment mutation) with a mutagenesis step using a suit- 
able physical or chemical mutacenizing agent, e.g., one which in- 
duces transitions, transversions , inversions, scramjoling, dele- 

20 tions, and/or insertions. 

ELxoressinc the rec orrjjinant protein from the reccmloinant shuffled 
sequences 

Exoression of the recomljinant protein encoded by the shuf- 
25 fled sequence in accordance with the second and third aspect oi 
the oresent invention may be performed by use of standard expres- 
sion vectors and corresponding expression system.s known in the 
art . 

30 Screening and selection 
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In the context of the present invention the term "positi- 
polypeptide variants" means resulting polypeptide variants po, 
sessing functional properties which has been improved in compar; 
son to the polypeptides producible from the corresponding inp-. 
DNA sequences. Examples, of such improved properties can be = 
different as e.g. enhance or lowered biological activity, i: 
creased wash performance, thermostability, oxidation stabiliti- 
substrate specificity, antibiotic resistance etc. 

Consequently, the screening method to be used for identif;. 
ing' positive variants depend on which property of the polypeptic 
in question it is desired to change, and in what direciion z:. 

change is desired, 

A number of suitable screening or seleczion systems -. 
screen or select for a desired biological activity are describe 
in the art. Examples are: 

Strauberg et al. (Biotechnology 13: 669-673 (1995) ce 
scribes a screening syste~ for subtilisin variants having Cal 
cium-indeoendent stability; 

Bryan et al. (Proteins l:326-33s (1966)) describes 
screening assay for processes having a enhanced thermal stabiL 



anc 



PCT-DK95/00322 describes a screening assay for lipases ha. 
inc i:r,proved wash pe r f or7:.?-::c-3 in ■.-.■ashing detergents. 

.".n em±>odiment of the invention comprises screening or se 
lection of recombinant protein (s), ■.■herein the desired biologies 
activity IS perforrr.ance in cisn-.-'ash or la'cndry detergents. E::-:a_- 
ples of suitable dish^'.ash or la'ondry detergents are disclosed 
PCT-DK95/00322 and WO 95/30311. 

If, for instance, the polypeptide in question is an enzy 
and .the desired improved f;::.ctional property is the wash perfor 
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ance, the screening may conveniently be performed by use of a 
filter assay based on the following principle: 

The recombination host cell is incubated on a suitable me- 
dium and under suitable conditions for the enzyme to be secreted, 
the medium being provided with a double filter comprising a first 
protein-binding filter and on top of that a second filter exhib- 
iting a low protein binding capability. The recombination host 
cell is located on the second filter. Subsequent to the incuba-. 
tion, the first filter comprising the enzyme secreted from the 
recorrJoination host cell is separated from the second filter com- 
prising said cells. The first filter is subjected to screening 
for the desired enzyr.atic activity and the corresponding micro- 
bial colonies present on the second filter are identified. 

The filter used for binding the enzymatic activity may be 
any protein binding filter e.g. nylon or nitrocellulose. The top- 
filter carrying the colonies or the expression organism nay be 
any filter that has no or lo'^- affinity for binding proteins e.g. 
cellulose acetate or Durapore^. The filter may be pre-treated 
with any of the conditions to be used for screening or m.ay be 
treated during the detection of enz^.— atic activity. 

The enzym.atic activiiy rr.ay be detected by a dye, fluores- 
cence, precipitation, pH indicator, IR-absorbance or any other 
krov,"^' eechnioue for detection, of enzyrriatic activity. 

Tne dececmng cc.T.oound rr.sy be i.T-obilized by any iramobiliz- 
ing agent e.g. agarose, agar, gelatin, polyacrylamice , starch, 
filter paper, cloth; or any ccr-bmation of imjnobilizing agents. 

If the improved funo'icnal property of the polypeptide is 
not sufficiently good after one cycle of shuffling, the polypep- 
tide may be subjected to ancther cycle. 

• In an embodiment c: f:.-.-:- invention wherein homologous 
Dolvnucieotides representm:; a nu.miar of mutations of the same 
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gene is used as templates at least one shuffling cycle is a back- 
crossing cycle with the initially used DNA. fragrment, which may be 
the, wild-type DNA fragirient. This eliminates non-essential muta- 
tions. Non-essential mutations may also be eliminated oy usi..g 
5 wild-type DNA fragments as the initially used input DNA material. 

Also contemplated to be within the invention is polypep- 
tides having biological activity such as insulin, ACTH, glucagon, 
somatostatin, somatotropin, thymosin, parathyroid hormone, pitu- 
ary hormones, somatomedin, erythropoietin, luteinizing hormone, 

10 chorionic gonadotropin, hypothalamic releasing factors, antidiu- 
retic hormones, thyroid stimulating hormone, relaxin, interferon, 
thrombopoeitin (TPO) and prolactin. 

A requirement to the starting parental Df^A sequences, en- 
coding the polypeptide (s) , to be shuffled, is that they are 

15 least 50%, 60%, 70%, 80%, 90"^, or 95% homologous. DNA sequences 
being less homologous will have less inclination to interact and 
recomb)ine . 

It is also ccntemolated according to the invention to shuf- 
fle parental polynucleotides that are homologous as indicatec 

20 above originating fror: v/ild type orcenisins of different genera. 

Further, the startir.c parental templates to be shuffled may 
oreferably have a lenct^; of from about 50 bp to 20 kb, preferably 
about 100 bp to 10 KD^ -ore preferred about 200 bp to 7 kb, espe- 
cially about 400 bp to 2 ko . 

25 Tne starting oarental D:;A sequences may be any DNA se- 

quences including wild- type D:;a sequences, DNA sequences encoding 
variants or mutants, or modifications thereof, such as extended 
or elongated DNA sequences, ana -ay also be the outcome of DNA 
sequences having been subjected to one or more cycles of shuffl- 

30 mc (I.e. output DNA sequences: according to the method of tne 
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invention or any other method (e.g. any of the methods described 
in the prior art section) . 

When using the method of the invention the resulting reco:::- 
bined homologous polynucleotides (i.e. shuffled DMA sequences), 
have had a number of nucleotide fragments exchanged. This results 
in replacement of at least one amino acid within the polypeptide 
variant, if comparing it with the parent polypeptide. Iz is to be 
understood that also silent exchanges are contemplated (i.e. nu- 
cleotide exchange which does not result in changes in the amino 
acid sequence) . 

MATERIALS AND METHODS 

Specific Method Used in the Exarr.ples: 

A) DNA encoding different enzyme variants of the same gene or 
different enzymes having the sa.T.e type of activity encoded by 
homologous genes are nixed. Tno DMA is provided as either PCR 
fragments, plasmid, phage or cencmic DNA. 

B) The resulting pool of is mixed with DNA Polymerase, 
dNT?, a suitable buffer and primers (being either random oli- 
gomers (length of 6-30 nucleotides) or specific oligomers 
(length of 6-50 nucleor :.::es ; or a coTJDination of both types) . 

C) The ?C?s mix::ure is cat into a PGR the rrnoc ycle r (either 
cold or hot) in a suitable tube. 

cl) The thermocycler is heated to a temperature of 9C- 
lOO^'C for a period of t irr.e ( t ypica 1 ly 1-10 min) :ln order to dena- 
ture the DNA templates. 

c2) Thereafter the fcllov-inc procedure (cycle) is fol- 
lowed (repeated) : The template is denatured (typically SO-IOO^C 
for 0-5 minutes) . Then the a r a tu re is lowered (typically to 

a value between 10°C and SC'C io:; 0-5 minutes) to allow annealinc 
of the primer to the sincle s'lranded template. Now the tempera- 
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ture is raised again to denaturation temperature (90-100°C) al- 
lowing small extension of the primer to be synthesized by the 
DNA polymerase during ramping. Alternatively a short extensicn 
period (typically 0-30 seconds at 70-75°C) can be introduced zz 
5 allov larger extensions of the primers to be generated. When the 
temperature reaches a value where denaturation takes place, the 
extended primers and templates are again separated. This proce- 
dure can be repeated (typically between 1 to 99 cycles), 
D) Having performed the desired number of cycles the gener- 
ic ated sxall DNA polymers can be purified from the oligoners usei 
as primers. One way is to isolate and clone a specific amplifier 
band containing the gene coding for the polypeptide of intere;: 
into a suitable vector. This can be done either on agerose gel 
(typically isolating fragments between 50 to 1000 base pairs;, 
15 by beads (using an affinity label on either templates or prir_- 
ers) or through columns. 

Z) Then the purified (or the not purified) DMA polv-m.ers cer, 
be asse-Joied in a standard rCR reaction (for instance 94°C, : 
mi"utes, (94^C, 30 sec; 55=C, 30 sec; 72°C, 2 mm)- 25, 72^C 5 

20 mrnutes, 4°C). 

Scecific orir.-jrs cr DNA ool^TT.ers generated by specific 
orimers can be addei: rn order to generate a specific DNA polymer 
cc^^^"^nirG the cene c:: .interesr. This As menticned in point D, 

rifled and cloned into a vector of in- 
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10 



15 



Method 1 . ■ 

The strong advantage, of the method exemplified here is the 

robustness and lack of PGR contamination problems, due to the 

constant presence of parental template. Furthermore this methc- 

is less labor demanding than methods described in the prior art, 

thereby providing excellent possibilities for automation. 

Nine different plasmids containing DNA sequences encoding 
9 different variants of the H. lanuginosa lipase gene, were 
nixed in equimolar amounts. The variant genes contained from two 
to seven mutations scattered throughout the gene. 

The DNA seouence of the H. lanuginosa lipase gene and the 
a.^.o acid secuence of the lipase are disclosed in EP 0 305 216 

Th» variants are indicated according to standard termxnol- 
ogv as described in e.g. EP 0 396 608, and WO 97/07202. 



■he following 9 variant genes 



v-are shuffled: 



25 



N94KtD96L+E99K 

ST?R?^?-^'J9 4?. 
SC:RR-N94KtD96L-I:2 



)96LrE99K-VlB7A 
;^?RR? + D5 7G^N9 4K'^09 5I 



-0249: 



N94R-F9 5L 



30 



The following co"ponenu5 v;n._^ 
2 ul plasxici rr^ixture ( • 
mo- tne gene (1 p-ol/^il) , 2 



v--^d in a rr.icrotube: 

uc:/Lil), specific primers flan!*: 

. 5 7?>'\ gKT?. 2 . 5 MgCl,, 2 [x 
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10 



10* taq buffer (Perkin Elmer), 0.5 ^1 taq enzyme in a total vol- 
ume of 20 ^1. 

The tube was set in a Perkin Elmer 2400 thermocycler . The 
following PCR-program was run: (94°C, 5 minutes) 1 cycle: 
(94^C , 30 seconds, VO'^C, 0 seconds) 99 cycles{72°C, 2 minutes, 

4°C indefinite) 1 cycle 

The PCR-reaction was run on a 1.5 % agarose gel. A DNA- 
band of the specific expected size was cut out of the agarose 
gel and purified using JETsorb (from GENOMED Inc.). The purified 
PCR-product was cloned into a TA-vector (from Invitrogen (the 
original TA cloning kit). The ligated product was transformer 
into a standard Escherichia coli strain (DH5a) . 

20 transfornants where fully sequenced across the gene of 

i n e r e s t . 



Resui ■ 



20 



25 

9 

10 . 

1 1 , 
12 

30 13 



following 20 variants w 
D137GrD167G+E210V^Y213; 
S ? ? R R ? ^ D 5 7 G - N 9 K r 0 9 6 L ~ ; 

:;3^R^r95L 

5 ? ? R R ? - 0 1 3 7 G - D 1 £ 7 G -r E 2 1 
N9^; rCT09 6L-E9 9?;-'/15 7A-T 



)137G^D15 / 



1 -m^r? ^ ^^ 



N34K+D96LrE99r>V137A 

D57G-rN9 4R+E9 5L-^Q2 4 9R 
:;94K-^D95L + E99K-rE210V 

S ? ? RR? ^ Al 9 C -rG 3 £r 

L ; 9 4 R - r 9 5 L 

D137G-D167G'^E210V 

■:94KtD95L^Q24 9R 
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14 . SPPRRP+Q15P+A19C+C36A+N94K+D96L 

15. SPPRRP+N94KiD96L+T2 31R+N233R+D2 3 4R+Q2 4 9R 

16. D137G+D167G-HE210V 

17 . SCIRR+N94K+D96L+Q249R 
5 18. N94K+D96L+E99K 

19. N94R+F95L 

20. SPPRRP+N94R+F95L+F113S+Q249R 

Nearly all mutations where represented (19 of 20) indicat- 
10 ing little bias for specific templates. 



Statistics: 
Not shuffled 

Shuffled between at least 2 templates 
Shuffled between at least 3 templates 



The shuffled sequences can then be subcloned from the E.coli TA 
vector into the yeast vector pJS026 as a BamHI-Xbal fragment 
15 (see WO 97/07205), and e.g. screened for new shuffled sequences 
v;ith im.proved performance in detergents (see WO 97/07205). 

Method 2 : 

20 PGR products of 10 different lipase variant genes were 

generated as above and pooled in equimvolar amounts. 

The following 10 mutant genes were shuffled. 

1 ' . D137G4-D167G + E:210V 

25 2 ' . D96LrE99K + V187A 

3 ' . -M94KtD96L-^E99K 
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4 ' 
5' 
6* 

7 ' 

8 ' 
9' 



SPPRRP+D57G+N94K-fD9 6LTQ24 9R 
D111N+F211A+G225P 

SPPRRP+N94K+D9 6L-KT231R+N233R-FD2 34R+Q2 4 9R 
SPPRRP+A19C+C3 6A + N94K+-D9 6L + Q24 9R 
STPRRP+N94R 
N94R4-F95L 



10'. SCIRR+N94K+D96L+E239C+Q249R 

The following mixture was generated in a suitable tube: 

1 |il PGR mixture (0.1 ^g) . decamer random primer (301: 
pmol), 2 ^1 10- Kienow buffer (Promega), 0,25 m:M dNT?, 2.5 r_V 

MgCi, in a total volume of 20 ul. 

The mixture was ss: in a ?Z2^00 thermocycler where the 
following program was run: 95^C, 5 minutes, 25°C 5 minutes, O.z 
ml Klenow enz^^^e was added, 2:/^: 60 minutes, 35°C 90 minutes. 

This procedure generates, a high nurrJDer of small DMA poly- 
mers originating from all parts o: the gene 

10 |al was taken out for test on agarose gel. 

10 i^il ?CR mixture [0.25 cNTP, 1 ^li 10" Taq buffer 

(Perkin Elmer), 2.5 r:-: :-kjCl., 0.5 ^1 Tag enzym.e) was added to 
-h^ 10 ul in tne tuoe :,n the tnermocycler. Then the following 
s-andarc ?CR-pro-ram was run: (9^0, 5 minutes) 1 cycle, (94°C 33 
seconds, 45"C, 30 seconds, 72^C 30 seconds) 25 cycles, 72°C 7 

m.mutes, 4°C indefinite. 

The PGR products were run on a 1.5'^ agarose gel. A clear 
unbiased smear was seen. DMA betv/een 400 and 800 bp was isolatec 
from, the gel. 

Half of tne purifieu : product vas mixed in a tube witr 
two 'specific primers (4C p:-!: flanking the gene of interest 
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0.25 roM dNTP, 2 )il 10* Taq buffer, 2.5 rruM MgClj. Then the fol- 
lowing standard PCR-program was run: (94°C , 5 minutes) 1 cycle, 
(94°C 30 seconds, 50°C, 30 seconds, 72°C 30 seconds) 25 cycles, 
72°C 7 minutes, 4°C indefinite. 
5 The PGR product was run on a 1.5% agarose gel. A specific 

though weak band of the expected size was isolated. Additional 
PGR was run using specific primers (as mentioned above) in order, 
to am.plify the PCR-product before cloning. 

The PGR-product and the desired vector were cut with the 
10 appropriate restriction enzym.es (BarrJ-il /Xhol ) . The vector and the 
PGR product were run on a 1.5% agarose gel, and purified fro- 
th e g e 1 . 

The cut PCR-product and the cut vector were mi.xed in a li- 
gase buffer with T4 DN'A ligase ( ?ror:^.ega ) . After overnight liga- 
15 tion at 1G°C the mixtiure was tiransforned into £, coli strair. 
DM 5 a . 

19 clones v/ e r e fully sequenced across the gene. 
Resu It: 

following 19 variants were found: 

ST?RR?rM9 4RrN2 3 3R-D2 3 4 4 5R 
5 p p p^p^ p -1^ v; 2 3 3 R-^ D2 3 4 R- 0 2 4 9 R 
D96L-D167GiQ24 9R 
M 9 4 R 
D1d7G 

SP?RR? + A19C+A2cTt-N9 4?(-D9 5L- 2111N~E2 3 9C 
S??RR?rA19C-^G3 6Ar';9 4KT-D9 c L-?2 4 9R 
N94rC*D9 6L 

i:2 5TrD5 7G + N94R-E:9 9r:-D16 7G-:-T2 31R-N2 3 3RrD2 3 4 RxQ2 4 9R 
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10. N94R+Q249R 

11. D167G 
12: D167G 

13. T32I+N94R+F95L+D167G+Q249R 
5 14. E87K+N94K-fD96L 

15. N94R+r95L+Q249R 

16. N94K+D96L+D111N 

17. STPRRP+S17T 

18. N94K+D96L+V187A 

10 19.' SP?RRP + D57G + N94K+D96L-f D111N+-L151S 



.^11 template variants were represented indicating little 
bias for specific templates. 

There were no apparent hot spots with regard to mutaticr. 
15 exchange and it seem.s to be evenly distributed along the gene 



Statistics : 
Not shuffled 
Shuffled betw 
Shuffled be'i 
Shuffled bet 



sen ac least 2 
een a': least 3 
e e n at least 4 



Shuffled b e t w e e : 



plates 
plates 
plates 
plates 
olates 



I 

10 

6 

1 

0 



The shuffled secuences can ::hen be subcloned from the 
E.coli TA vector into the veast vector pJS026 as- a BamHI-Xbal 
20 fragment (see WO 97/07205), and e.g. screened for new shuffled 
secuences with improved performance in detergents (see WO 
97 /07 205) . 
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Example 3: 

Amylase variant shuffling: 

5 In Example 1, it was -hown how a number of multiple vari- 

a .zs of H. lanuginosa lipase were shuffled. In a similar manner 
variants of Bacillus a-amylases can be shuffled. 

Earlier patent applications have identified variants of 
various a-amylases frorr. Bacillus species improved for particula 
10 properties, e.g. thermostability, stability under Calcium- 
depleted conditions, improved wash-performance etc. (see 
WO95/10603, W096/23874, W096/23873, and PCT/DK97/00197.) . 

Variants of B. lichenifor~is a-amyiase amyL can be shuf- 
fled as follows. The variants are all located in the B. sub^ili 
15 expression vector pDN1528 described in WO95/10603. 

The experiment is carried out under the exact same condi- 
tions as Example 1 except that the flanking 27mer primers used 
to initiate DN.-. synthesis were different. 

The ?CR an-.plifiec cane of approxim.ately 1500 bp is puri- 
20 fied from an agarose cei and cloned as described in Exampiel. 

1 ..^ly v-estricticr, sites located within the amyL gene ca 

.,. , . ■ .■ V,.- o- =;'-^u-'fled qenss into either 

be utilized to cior.e :.-.j-=i-V ^- o..u^i.x-^ y- 

^ .^1,,, .Tn>-^-,9=: r. coli vector containing the 

- oJelNl described ir. W096/2387 4 



v;ilc type ar.yL crjne, 
25 
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Shuffling of two genes encoding homologous a-amylases: 
amyL and the amylase identified by SEQ.ID no2 (amino acid) and 
SEQ.ID no. 5 (DNA) described in W096/23873. 



5 The forward strand (identical to the mRNA) of amyL can be ampli- 
fied in a PGR using standard conditions. 

The forward strand is separated from the reverse strand 
based on its affinity to streptavidin coated magnetic beads and 
denaturation of the two strands with NaOH, 
10 Similarly the reverse strand (complementary to mRNA) of the amy- 
lase encoded by SEQ.ID no5 (W096/23873) can be amplified and 
isolated 

Two primer strands are used as templates in a PGR: (94 ''C 5 
minutes) 99 cycles of (94°C, 30 seconds; GCG, 0 seconds)-^' 
15 (72°G, 5 minutes) using random priT.ers of various lengths, Taq 
polymerase and standard buffer conditions as described in Exam- 
ple 1 . 

The resulting approximately 1500 bp product is cloned ei- 
ther as TA cloning as described in Exanple one (for verification 
20 of the seauence of the resulting clone) or into Bacillus vector, 
pTV3110 utilizing Sfil and PstI restriction sites. 

The original template can be removed from the PGR at any 
step (e.g. after 5, 10 or 20 cycles) based on the biotin tag). 



25 
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PATENT CLAIMS 

1. a method for the construction of a library of recombined 
homologous polynucleotides from a number of different starting 
single or double stranded parental DNA templates and primers by 
induced template shifts during an in vitro polynucleotide syn- 
thesis using a polymerase, whereby 

A. extended primers or polynucleotides are synthesized by 

a) denaturing a parental double stranded DNA template to 
produce single stranded templates, 

b) annealing said primers to the single stranded DNA 
templates, 

c) extending said primers by initiating synthesis by use 
of said polym.erase/ 

d> cause arrest of the syr. thes is , and 

e) denaturing the do-jole strand to separate the extended 
primers from the terr.plates, 

B. a template shift is inci'jced by 

a) isolating the nev.'ly synthesized single stranded ex- 
tended primers frc" the templates and repeating steps 
A.b) to A,e) usir.r said extended primers produced in 
(A) as both prirr.ors and templates, or 

b) repeating steps A.b) to A.e), 

C. the above process rs terminated after an appropriate num- 
ber of cycles of process steps A. and 5. a), A. and 3.b), 
or combinations thereof, and 

D. optionally the produced polynucleotides are amplified in a 
standard PGR reaction wrt:\ specific primers to selectively 
anplify homologous pol yr.u 3 1 eor ides of interest. 

2 The method of claim 1, v,-nerein said polymerase is a ther- 

mostable DNA polymerase, such as Taq, Amplitaq, Vent, Pwo . 
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3. The method of claim 1, wherein said polymerase is a DNA 
polymerase, such as T4 polymerase, T7 polymerase , E, coli DNA 
polymerase I or the Klenow fragment of DNA polymerase I, and 
said polymerase is added to the reaction mixture after each cy- 
cle . 

A. The method of claim 1, wherein said polymerase is a JINA 
polymerase . 

5. The method of any of the claims 1 to 4, wherein said po- 
lymerase is an error-prone polynerase, such as a reverse zran- 
scriptases like HIV reverse transcriptase. 

6. The method of any of the claims 1 to 5, wherein said 
starting single or double stranded parental templates are dif- 
ferent in that they contain different point mutations in the 
same native polynucleotide (gene), or are homologous polynu- 
cleotides (genes) isolated from nature. 

7. The method of claim 5, v/herem said starting single or 
double stranded parental t^emplates exhibit an identity at the 
DNA level of e.g. more than 95%, S0%, 85%, 80%, 75%, 70%, 65%, 
60%, 55%, or even more than 50% idenriity. 

8. The method of any of the claims 1 to 7, wherein the ar- 
rest of the polymerase reaction m step A.d) is achieved by 
raising the temperature. 

9. The method of claim S, v;r.-rein the temperature is raised 
to a value between 90'C and 99"C, preferably 94^C. 
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10. The method of any of the claims 1 to 7, wherein the ar- 
rest of the polymerase reaction in step l.d is achieved by add- 
ing specific reagents, such as DMSO. 

5 

11. The method of any of the claims 1 to 10, wherein said 
primers comprise a population of completely random primers. 

12. The method of any of the claims 1 to 10, wherein said 
10 primers comprise a population of semi -random primers. 

13. The method of any of the , claims 1 to 10, wherein said 
primers comprise a population of specific primers. 

15 14. The method of any of the claims 1 to 13, wherein said 
pritT^.ers are labeled, preferably with biocin or digoxigenin. 

15. The method of any of the claims 11, 12 or 14, wherein 
saic rando.T, or semi-randor-, prirr.ers have a length of frorr, 6 to 
20 500 bp, preferably frora 15 to 300 do, and better from 25 to 150 



DO . 



o: ar.v or the claims 13 or 14, wherein said 



prirr.ers are 6 tc 7 5 bp long. 

17. Tr.e rr-thod of any of the ciairr^s 1 to 16, wherein said 
starting parental Di:?. templates are at least 50%, 60%, 70%, 
80%, 901, or 95% homologous. 

30 13. .The method of claim 17, v;herein said starting parental 
DM.i. templates originate from ■.-.•ilr; type organisms of different 

a e e r a 
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19. The method of any of the claims 17 or 18, wherein said 
starting parental templates have a length of from about 50 bp 
to 20 kb, preferably about 100 bp to 10 kb, more preferred 

5 about 200 bp to 7 kb, especially about 400 bp to 2 kb 

20. The method of any of the claims 1 to 19, wherein said 
starting parental templates are linear DNA fragments generated 
by a PGR reaction. 

10 

21. The method of any of the claims 1 to 20, wherein said 
starting parental templates are . cloned into suitable vectors, 
such as a plasmid. 

15 22. The method of any of the claims 1 to 21, wherein said 
starting parental templates represent a polynucleotide encoding 
an enzyme, such as a carbonyl hydrolase, carbohydrase , or es- 
terase, such as a protease, lipase, amylase, cellulase, oxi- 
dase, oxido reductase, etc. 



20 



25 



30 



23. The method of any c: the claims 1 to 21, wherein said 
starting parental te-piates represent a polynucleotide encoding 
a polypeptide having biclcgical activity, such as insulin, 
ACTH, glucagon, so-at:ostatin, soma-Qtropin, thymosin, parathy- 

■ ^ . s^.— soT=tomedin, erythropoietin, 

roid nonr^one, p^^ua^ j ..w_.,,o..-b , ^.o.,..- j 

luteinizing horm.one, chorionic gonadotropin, hypothala.mic re- 
leasing factors, anticiuretic hormones, thyroid stimulating 

hormone, reiaxin, inter. -r^^.., ^..^o.^^^oo^^l^.. 
tin . 

24. The method of any of cne claims 1 to 21, wherein said 
starting parental templates represent a polynucleotide contain- 
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ing a biological function such as transcription initiation 
(promoter) or termination, translational initiation, operator 
sites related to expression of genes 

5 25. The method of any of the claims 1 to 24, wherein said 
number of cycles are less than 500, 200, or less than 100 cy- 
cles . 

26. A method of identifying a polypeptide of interest exhib- 
10 iting improved properties in comparison to naturally occurring 

or other known polypeptides of the same activity, whereby a li- 
brary of recombined homologous .polynucleotides produced by a 
process according to any of the claims 1 to 25 are cloned into 
an appropriate vector, said vector is transformed into a suit- 
15 able host system, to be expressed into the corresponding 
polypeptides and displayed, and said polypeptide is screened in 
a suitable assay, and positive polypeptides selected. 

27. A method for producing a polypeptide of interest as iden- 
20 tified according to claim 25, whereby a vector comprising a 

poivnucleotide encoding said identified polypeptide is trans- 
formed into a suitable host, said host is grown to express said 
polypectide, and the polypeptide recovered and purified. 
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